Supplementary Figure S1 : Schematic phylogeny of the African haplogroups found in the 982 
individuals of our dataset. The reference sequences RSRS (Behar et al. 2012) and rCRS 
(corresponding to European haplogroup H, Andrews et al. 1999) are indicated with a black line. 
Dashed lines indicate nodes with low posterior probability for which the topology could not be 
resolved. The width of the triangles is proportional to the frequency of a given haplogroup in the 
dataset. Haplogroups L3d and L3f, in yellow, are described in more detail in Figure S4Aand S4B. 
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Supplementary Figure S2: CA plots based on basic haplogroup frequencies. Color coding by linguistic 
affiliation as specified in Table 1. A: Including all haplogroups and all populations. B: Including all haplo- 
groups but excluding outlier populations. C: Excluding haplogroups LOd and LOk and excluding outliers. 
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Supplementary Figure S3: Haplogroup frequencies of important haplogroups in the 
populations studied here. A: Haplogroups LOd and LOk. B. Subhaplogroups L3d3a1 and 
L3f1b4a. The size of the circles is proportional to the sample size. 




Supplementary Figure S4A: Bayesian tree of all African complete mtDNA sequences belonging to 
haplogroup L3d, and S4B: belonging to haplogroup L3f. The sequences included in these trees 
are the same as those included in the networks in Figure 4B and D. Branches discussed in the 
main text are highlighted. The posterior probability associated with major nodes is indicated in red 
font. Note the different time scale between the two trees. 
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Supplementary Figure S5: Bayesian Skyline Plots for the Himba, Herero, Kuvale and 
Damara. X axis: time in years ago. Y axis: effective population size per generation time. 
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